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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: EVANS, CHRISTOPHER J. 

KEITH, DUANE E. " 

(ii) TITLE OF INVENTION: OPIOID RECEPTOR GENES 

(iii) NUMBER OF SEQUENCES.: 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORRISON & FOERSTER 

(B) STREET: 2000 Pennsylvania Avenue, N.W. Suite 5500 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20006-1888 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/403,260 

(B) FILING DATE: 13-MAR-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: MURASHIGE, KATE H. 

(B) REGISTRATION NUMBER: 29,959 

(C) REFERENCE/DOCKET NUMBER: 22000-20526.21 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 887-1500 

(B) TELEFAX: (202) 887-0763 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

, (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Tyr Gly Gly Phe Met 
1 5 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Tyr Gly Gly Phe Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 3: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : . single 

(D) TOPOLOGY: linear 

(ix) FEATURE; 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "D form of amino acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Tyr Ser Gly Phe Leu Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: group (2, 5) 

(D) OTHER INFORMATION: /product= "OTHER" 
/note= "D-penicillamine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Tyr Xaa Gly Phe Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2. 

(D) OTHER INFORMATION: /note= "D form of amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "D form of amino acid" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



Tyr Ala Gly Phe Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: Modified 

(B) LOCATION:. 2 

(D) OTHER INFORMATION: 

(ix) FEATURE: . 

(A) NAME/KEY: Modified 

(B) LOCATION: 4 

(D) OTHER INFORMATION: 
/note= "N-Methylphenylalanine " 

(ix) FEATURE: 

(A) NAME/KEY: Modified- 

(B) LOCATION: 5 

(D) OTHER INFORMATION: 
/note= "Carboxy end of glycine 
substituent " 



site 

/note= "D form of amino acid" 
site 

/product= "MePhe" 
site 

/product= "Gly-ol" 

has been replaced with an alcohol 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Tyr Ala Gly Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1829 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 29.. 1144 

(xi) SEQUENCE DESCRIPTION: ' SEQ ID N0:7: 



GCACGGTGGA GACGGACACG GCGGCGCC ATG GAG CTG GTG CCC TCT GCC CGT 

Met Glu Leu Val Pro Ser Ala Arg 
1 5 

GCG GAG CTG CAG TCC TCG CCC CTC GTC AAC CTC TCG GAC GCC TTT CCC 
Ala Glu Leu Gin Ser Ser Pro Leu Val Asn Leu Ser Asp Ala Phe Pro 
10 15 . 20 ' 

AGC GCC TTC CCC AGC GCG GGC GCC AAT GCG TCG GGG TCG CCG GGA GCC 
Ser Ala Phe Pro Ser Ala Gly Ala Asn Ala Ser Gly Ser Pro Gly Ala 
25 30 35 40 
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CGT AGT GCC TCG TCC CTC GCC CTA GCC ATC GCC ATC ACC GCG CTC TAG 196 
Arg Ser Ala Ser Ser Leu Ala Leu Ala lie Ala lie Thr Ala Leu Tyr 
45 50 55 

TCG GOT GTG TGC GCA GTG GGG CTT CTG GGC AAC TGT CTC GTC ATG TTT 244 
Ser Ala Val Cys Ala Val Gly Leu Leu Gly Asn Cys Leu Val Met Phe 
60 65 70 

GGC ATC GTC CGG TAC ACC AAA TTG AAG ACC GCC ACC AAC ATC TAC ATC 292 
Gly lie Val Arg Tyr Thr Lys Leu Lys Thr Ala Thr Asn lie Tyr lie 
75 80 85 

TTC AAT CTG GCT TTG GCT GAT GCG CTG GCC ACC AGC ACG CTG CCC TTC 340 
Phe Asn Leu Ala Leu Ala Asp Ala Leu Ala Thr Ser Thr Leu Pro Phe 

90 ■ 95 , 100 

CAG AGC GCC AAG TAC TTG ATG GAA ACG TGG CCG TTT GGC GAG CTG CTG 388 
Gin Ser Ala Lys Tyr Leu Met Glu Thr Trp Pro Phe Gly Glu Leu Leu 
105 110 115 120 

TGC AAG GCT GTG CTC TCC ATT GAC TAC TAC AAC ATG TTC ACT AGC ATC 4 36 

Cys Lys Ala Val Leu Ser lie Asp Tyr Tyr Asn Met Phe Thr Ser lie 
125 130 135 

TTC ACC CTC ACC ATG ATG AGC GTG GAC CGC TAC ATT GCT GTC TGC CAT 484 
Phe Thr Leu Thr Met Met Ser Val Asp Arg Tyr lie Ala Val Cys His 
140 145 150 

CCT GTC AAA GCC CTG GAC TTC CGG ACA CCA GCC AAG GCC AAG CTG ATC 532 
Pro Val Lys Ala Leu Asp Phe Arg Thr Pro Ala Lys Ala Lys Leu lie 
155 160 165 

AAT ATA TGC ATC TGG GTC TTG GCT TCA GGT GTC GGG GTC CCC ATC ATG 580 
Asn lie Cys lie Trp Val Leu Ala Ser Gly Val Gly Val Pro lie Met 
170 175 180 

GTC ATG GCA GTG ACC CAA CCC CGG GAT GGT GCA GTG GTA TGC ATG CTC 628 
Val Met Ala Val Thr Gin Pro Arg Asp Gly Ala Val Val Cys Met Leu 
185 190 195 . 200 

CAG TTC CCC AGT CCC AGC TGG TAC TGG GAC ACT GTG ACC AAG ATC TGC 676 
Gin Phe Pro Ser Pro Ser Trp Tyr Trp Asp Thr Val Thr Lys lie Cys 
205 210 215 

GTG TTC CTC TTT GCC TTC GTG GTG CCG ATC CTC ATC ATC ACG GTG TGC 724 
Val Phe Leu Phe Ala Phe Val Val Pro lie Leu lie lie Thr Val Cys 
220 225 230 

TAT GGC CTC ATG CTA CTG CGC CTG CGC AGC GTG CGT CTG CTG TCC GGT 772 
Tyr Gly Leu Met Leu Leu Arg Leu Arg Ser Val Arg Leu Leu Ser Gly 
235 240 245 

TCC AAG GAG AAG GAC CGC AGC CTG CGG CGC ATC ACG CGC ATG GTG CTG 820 
Ser Lys Glu Lys Asp Arg Ser Leu Arg Arg lie Thr Arg Met Val Leu 
250 255 260 

GTG GTG GTG GGC GCC TTC GTG GTG TGC TGG GCG CCC ATC CAC ATC TTC 868 
Val Val Val Gly Ala Phe Val Val Cys Trp Ala Pro He His He Phe 
265 270 275 280 

GTC ATC GTC TGG ACG CTG GTG GAC ATC AAT CGG CGC GAC CCA CTT GTG 916 
Val He Val Trp Thr Leu Val Asp He Asn Arg Arg Asp Pro Leu Val 
285 290 295 
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GTG GCC OCA CTG CAC CTG TGC ATT GCG CTG GGC TAG GCC AAC AGO AGO 964 
Val Ala Ala Leu His Leu Cys lie Ala Leu Gly Tyr Ala Asn Ser Ser 
300 305 310 

CTC AAC CCG GTT CTC TAG GCC TTC CTG GAG GAG AAC TTC AAG CGC TGC 1012 
Leu Asn Pro Val Leu Tyr Ala Phe Leu Asp Glu Asn Phe Lys Arg Cys 
315 320 325 

TTC CGC CAG CTC TGT CGC ACG CCC TGC GGC CGC CAA GAA CCC GGC AGT 1060 
Phe Arg Gin Leu Cys Arg Thr Pro Cys Gly Arg Gin Glu Pro Gly Ser 
330 335 340 

CTC CGT CGT CCC CGC CAG GCC ACC ACG CGT GAG CGT GTC ACT GCC TGC 1108 
Leu Arg Arg Pro Arg Gin Ala Thr Thr Arg Glu Arg Val Thr Ala Cys 

345 3.50 355 , 360 

ACC CCC TCC GAC GGC CCG GGC GGT GGC GCT GCC GCC TGACCTACCC 1154 
Thr Pro Ser Asp Gly Pro Gly Gly Gly Ala Ala Ala 





365 




370 








GACCTTCCCC 


TTAAACGCCC 


CTCCCAAGTG 


AAGTGATCAG 


AGGCCACACC 


GAGCTCCCTG 


1214 


GGAGGCTGTG 


GCCACCACCA 


GGACAGCTAG 


AATTGGGCCT 


GCACAGAGGG 


GAGGCCTCCT 


1274 


GTGGGGACGG 


GCCTGAGGGA 


TCAAAGGCTC 


CAGGTTGGAA 


CGGTGGGGGT 


GAGGAAGCAG 


1334 


AGCTGGTGAT 


TCCTAAACTG 


TATCCATTAG 


TAAGGCCTCT 


CAATGGGACA 


GAGCCTCCGC 


1394 


CTTGAGATAA 


CATCGGGTTC 


TGGCCTTTTT 


GAACACCCAG 


CTCCAGTCCA 


AGACCCAAGG 


1454 


ATTCCAGCTC 


CAGAACCAGG 


AGGGGCAGTG 


ATGGGGTCGA 


TGATTTGGTT 


TGGCTGAGAG 


1514 


TCCCAGCATT 


TGTGTTATGG 


GGAGGATCTC 


TCATCTTAGA 


GAAGAAAGGG 


GACAGGGCAT 


1574 


TCAGGCAAGG 


CAGCTTGGGG 


TTTGGTCAGG 


AGATAAGCGC 


CCCCCTTCCC 


TTGGGGGGAG 


1634 


GATAAGTGGG 


GGATGGTCAC 


GTTGGAGAAG 


AGTCAAAGTT 


CTCACCACCT 


TTCTAACTAC 


1694 


TCAGCTAAAC 


TCGTTGAGGC 


TAGGGCCAAC 


GTGACTTCTC 


TGTAGAGAGG 


TACAAGCCGG 


1754 


GCCTGATGGG 


GCAGGCCTGT 


GTAATCCCAG 


TCATAGTGGA 


GGCTGAGGCT 


GGAAAATTAA 


1814 


GGACCAACAG 


CCCGG 










1829 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Glu Leu Val Pro Ser Ala Arg Ala Glu Leu Gin Ser Ser Pro Leu 
1 5 .10 15 

Val Asn Leu Ser Asp Ala Phe Pro Ser Ala Phe Pro Ser Ala Gly Ala 
20 25 30 



Asn Ala Ser Gly Ser Pro Gly Ala Arg Ser Ala Ser Ser Leu Ala Leu 
35 40 . 45 



-59- 

Ala lie Ala lie Thr Ala Leu Tyr Ser Ala Val Cys Ala Val Gly Leu 
50 55 60 

Leu Gly Asn Cys Leu Val Met Phe Gly lie Val Arg Tyr Thr Lys Leu 
65 70 75 80 

Lys Thr Ala Thr Asn lie Tyr lie Phe Asn Leu Ala Leu Ala Asp Ala 
85 ^ 90 95 

Leu Ala Thr Ser Thr Leu Pro Phe Gin Ser Ala Lys Tyr Leu Met Glu 
100 105 110 

Thr Trp Pro Phe Gly Glu Leu Leu Cys Lys Ala Val Leu Ser lie Asp 
115 120 125 

Tyr Tyr Asn Met Phe Thr Ser lie Phe Thr Leu Thr Met Met Ser Val 
130 135 140 

Asp Arg Tyr lie Ala Val Cys His Pro Val Lys Ala Leu Asp Phe Arg 
145 150 155 160 

Thr Pro Ala Lys Ala Lys Leu lie Asn lie Cys lie Trp Val Leu Ala 
165 170 175 

Ser Gly Val Gly Val Pro He Met Val Met Ala Val Thr Gin Pro Arg 
180 185 190 

Asp Gly Ala Val Val Cys Met Leu Gin Phe Pro Ser Pro Ser Trp Tyr 
195 200 205 

Trp Asp Thr Val Thr Lys He Cys Val Phe Leu Phe Ala Phe Val Val 
210 215 220 

Pro He Leu He He Thr Val Cys Tyr Gly Leu Met Leu Leu Arg Leu 
225 230 235 240 

Arg Ser Val Arg Leu Leu Ser Gly Ser Lys Glu Lys Asp Arg Ser Leu 
245 250 255 

Arg Arg He Thr Arg Met Val Leu Val Val Val Gly Ala ' Phe Val Val 
260 265 270 

Cys Trp Ala Pro He His He Phe Val He Val Trp Thr Leu Val Asp 
275 280 285 

He Asn Arg Arg Asp Pro Leu Val Val Ala Ala Leu His Leu Cys He 
290 295 300 

Ala Leu Gly Tyr Ala Asn Ser Ser Leu Asn Pro Val Leu Tyr Ala Phe 
305 310 315 320 

Leu Asp Glu Asn Phe Lys Arg Cys Phe Arg Gin Leu Cys Arg Thr Pro 
325 330 335 

Cys Gly Arg Gin Glu Pro Gly Ser Leu Arg Arg Pro Arg Gin Ala Thr 
340 345 350 

Thr Arg Glu Arg Val Thr Ala Cys Thr Pro Ser Asp Gly Pro Gly Gly 
355 360 365 

. Gly Ala Ala Ala 
370 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Glu Leu Thr Ser Glu Gin Phe Asn Gly Ser Gin Val Trp lie Pro. 
1 5 10 15 

Ser Pro Phe Asp Leu Asn Gly Ser Leu Gly Pro Ser Asn Gly Ser Asn 
20 25 30 

Gin Thr Glu Pro Tyr Tyr Asp Met Thr Ser Asn Ala Val Leu Thr Phe 
35 40 45 

lie Tyr Phe Val Val Cys Val Val Gly Leu Cys Gly Asn Thr Leu Val 
50 55 60 

lie Tyr Val lie Leu Arg Tyr Ala Lys Met Lys Thr lie Thr Asn lie 
65 70 75 80 

Tyr lie Leu Asn Leu Ala lie Ala Asp Glu Leu Phe Met Leu Gly Leu 
85 90 95 

Pro Phe Leu Ala Met Gin Val Ala Leu Val His Trp Pro Phe Gly Lys 
100 105 110 

Ala He Cys Arg Val Val Met Thr Val Asp Gly He Asn Gin Phe Thr 
115 120 125 

Ser He Phe Cys Leu Thr Val Met Ser He Asp Arg Tyr Leu Ala Val 
130 135 140 

Val His Pro He Lys Ser Ala Lys Trp Arg Arg Pro Arg Thr Ala Lys 
145 150 155 160 

Met He Asn Val Ala Val Trp Gly Val Ser Leu Leu Val He Leu Pro 
165 170 175 

He Met He Tyr Ala Gly Leu Arg Ser Asn Gin Trp Gly Arg Ser Ser 
180 185 190 

Cys Thr He Asn Trp Pro Gly Glu Ser Gly Ala Trp Tyr Thr Gly Phe 
195 200 205 

He He Tyr Ala Phe He Leu Gly Phe Leu Val Pro Leu Thr He He 
210. 215 220 

Cys Leu Cys Tyr Leu Phe He He He Lys Val Lys Ser Ser Gly, He 
225 230 235 240 

Arg Val Gly Ser Ser Lys Arg Lys Lys Ser Glu Lys Lys Val Thr Arg 
245 250 255 

Met Val Ser He Val Val Ala Val Phe He Phe Cys Trp Leu Pro Phe 
260 265 270 

Tyr. He Phe Asn Val Ser Ser Val Ser Val Ala He Ser Pro. Thr Pro 
275 280 285 

Ala Leu Lys Gly Met Phe Asp Phe Val Val He Leu Thr Tyr Ala Asn 
290 295 300 
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Ser Cys Ala Asn Pro lie Leu Tyr Ala Phe Leu Ser Asp Asn Phe Lys 
305 310 315 320 

Lys Ser Phe Gin Asn Val Leu Cys Leu Val Lys Val Ser Gly Ala Glu 
325 330 335 

Asp Gly Glu Arg Ser Asp Ser Lys Gin Asp Lys Ser Arg Leu Asn Glu 
340 345 350 

Thr Thr Glu Thr Gin Arg Thr Leu Leu Asn Gly Asp Leu Gin Thr Ser 
355 360 365 



He 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGGCAGTGGT GTGCATGCTC CAGTTCCCCA GCCCCAGCTG GTACTGGGAC ACGGTGACCA 60 

AGATCTGCGT GTTCCTCTTC GCCTTCGTGG TGCCCATCCT CATCATCACC GTGTGCTATG 120 

GCCTCATGCT 130 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGTGCAGTGG TATGCATGCT CCAGTTCCCC AGTCCCAGCT GGTACTGGGA CACTGTGACC 60 

AAGATCTGCG TGTTCCTCTT TGCCTTCGTG GTGCCGATCC TCATCATCAC GGTGTGCTAT 12 0 

GGCCTCATGC 13 0 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCTGGCCTTT TGGGGATGTG CTGTGCAAGA TAGTAATTTC CATTGATTAC TACAACATGT 60 
TCACCAGCAT CTTCACCTTG ACCATGATGA GCGTGGACCG CTACATTGCC GTGTGCCACC 120 



CCGTGAAGGC TTTGGACTTC CGCACACCCT TGAAGGCAAA GATCATCAAT ATCTGCATCT 180 
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GGCTGCTGTC GTCATCTGTT GGCATCTCTG CAATAGTCCT TGGAGGCACC AAAGTCAGGG 24 0 

AAGGTAAGAG CAGTCATTTC ATfCTGTTCA TAAAAATGTA GCTTCAAATT ACATAGACTT 300 

TTAATTTGAG CGTGAGTAGG CCACATATTT GTGGAAATCG ATGCCAAAAG ACGACGGAAA 360 

TGTAGTGCCT AAATCCATGG AAGATGAGAA GTAGAACAAT TTTTTGTCCC TTTCCACCTC 420 

TAAACACAGA ATGCAATAAT GACATTGCCA GAAGAGAGAT GCCCGACCTG TCTCCCATTC 480 

TGGCAATGTT TAGTAGAAAG TGGAGGGGTG AGGATGAGGT AAGAACCACA GGCATGTAGA 540 

TTTTAAAGTA CAACCTGGCA AGTCCAGACA CACCTTCTCA CTCCTTTTTT TCTCTTTAAC 600 

AAGGGATATA AATTATTGGT GACATATGCT GGTTGTTTCC TCTTTTATTC CTAAAGGATA 660 

ACCTCCAAAT CACTATTTTA ACAGCTTTGG CGTAGGATCT CAAAATCAAG TTAACGGATG 720 

GTAGTTACAG ATGAGTCAGA ACCACTTGAT TTGGACATAT CAGGTTTTCC CTTGCAAACC 780 

AGCCAACTGA tTTTTTTTTT xTTTTTTTTT GAGAGAGAGT CTTGCTCTGT TGCCAGGCTA 840 

GAGTGCAGTG GCGCGATATC GGCTCACTGC AACCTCTGCC TCCCGGGTTC AACCTCAGCC 900 

TCTCGAGTAG CTGGGACTAC TGGCACACAC CACCAfGCCC AGCTAATTTT TGTATTTTTA 960 

GTAGAGACAG GGTTTCACCG TGTTGGCCAG GGTGGTCTCA ATCTCTTGAC CTCGTGATCT 1020 

GCCCGCCTCG NCTCCCCAAA GTGCTGGGAT TACAGGCGTG CNCTGCNCCC GNCCCCTGTT 1080 

GATGTTTTTC CTGTATTTCT AGGACAGTAG TTCTCACTCT GGGCTGCACA TTGGAATCAC 1140 

CTGGGTACTT TAGAAAACAC TGCTGCCTGC ATCCCACCCC TTAAGGGTCT GGTGTAATTG 1200 

ACCTGGGGTA CAGCCTGGGT GTCAAGATTT TTGAGCTCTC TCCAGGTGAC TCTGACCTGC 1260 

AGCCAAGGTG AGAGGTACTG TTCTAGGAGT TTTGCTTTAC TAGCAAAATA TAAAGCTATA 132 0 

GAAAGCATCT TTTGTTCCTC ATAGAAATTA ATGATGGGGA GGTGAGCAGA ATAGTCACTC 1380 

TGGGCCTACT CATGCTGTTT AATGCTCCAG CAGGTATATA GGTTCTCCAG TTACTAGGGG 1440 

GTTCATAATA CCTGTGAGAG CAGATAACTG AGTGTATATA GTGAGGATTT CCAGGTCATA 1500 

GTGAAAGGGC AAGGCACTAA AATCATAGCT TGTCTTGCAT ATACTGTTTG TTTGTTTTTA 1560 

GACTTACATG TTAGGTTTCA GTTTACGTTT TAGGTTCACA GCAAAACTGA CCAGAAAGCA 1620 

CAGAGAGGCA CTTCNATTTA CCTCCATTTA CCCCACACAG GCACATCCTC CCCTACAGAG 1680 

TGGTCCATTT ATTACAGCTG CTGAACCCAC ACTGACACGC TGTTATCACT CAGAGCCTGG 1740 

CAGTTTACAG AGGCTCACTC TCCGNTATGT GTCCTGTGNT TTGAACAAAT GTATAATGAC 1800 

TTTATTCATT GTTTTTTAAT GAAGCTGATC TTTTCCCTCT GAAACTACAA AATGAATTTC 1860 

TAGCATAGCC ATAGCAGGTG TCAAGCTATA CTACTAGGTA AATTTTAAGA AATGCCCAAC 1920 
TTTATCATAT TTGCATTTCA AAATATGATT AATCACACAT AGGATTTTGT TTCTTCATGC - 1980 

CTACAGCAAA TAGAAATAAA GTGCAAGAAA CTTTTCTGAG GCAAAGCTTT CACTTTGTGA 2040 

ACGTAAAATG TTGACTCTAA TATTTTCCAT ACTGTAGTAT ATGTGTGTGT ATTATGTGAG 2100 

GATTCATAGT CTGCTCTTAC TTTTTTATAG TAGCTAAGAA TTATTATAAT CGCTATAAGC 2160 
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AGAAACAATT ATTCTTAACA AAATGAATAC ACACAAGAAA AGCTTTAGTT TAGCTATTAG 2220 

AACTAACTCT ATAATTATGA TAACCATGAG ATGCTGGAAC AGGAGCCAGC AGAAGCCACA 2280 

GCCCTCTGAT ATTAATATAT AAAGAAACCA AAATCTGCTT GTTAAACTGA GGCAGTTGTA 2340 

TGGATACTTC AACCTGAAAA TGCCCCCTTC TTCCTGAAAC AGAACATTTA ATAAAAATGG 2400 

CATGCTTGGA CAGGAATTTC TTTTTTAAAA AATGCTTAGT TTTTATG 2447 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



TTCCTTTATC 


TCCTAGATAC 


ACCAAGATGA 


AGACTGCCAC 


CAACATCTAC 


ATTTTCAACC 


60 


TTGCTCTGCA 


GATGCCTTAG 


CCACCAGTAC 


CCTGCCCTTC 


CAGAGTGTGA 


ATTACCTAAT 


120 


GGGAACATGG 


CCATTTGGAA 


CCATCCTTTG 


CAAGATAGTG 


ATCTCCATAG 


ATTACTATAA 


180 


CATGTTCACC 


AGCATATTCA 


CCCTCTGCAC 


CATGAGTGTT 


GATCGATACA 


TTGCAGTCTG 


240 


CCACCCTGTC 


AAGGCCTTAG 


ATTTCCGTAC 


TCCCCNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


300 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


360 


NNNNNNNNGT 


TCCATAGATT 


GTACACTAAC 


ATTCTCTCAT 


CCAACCTGGT 


ACTGGGAAAA 


420 


CCTGCTGAAG 


ATCTGTGTTT 


TCATCTTCGC 


CTTCATTATG 


CCAGTGCTCA 


TCATTACCGT 


480 


GTGCTATGGA 


CTGATGATCT 


TGCGCCTCAA 


GAGTGTCCGC 


ATGCTCTCTG 


GCTCCAAAGA 


540 


AAAGGACAGG 


AATCTTCGAA 


GGATCACCAG 


GATGGTGCTG 


GTGGTGGTGG 


CTGTGTTCAT 


600 


CGTCTGCTGG 


ACTCCCATTC 


ACATTTACGT 


CATCATTAAA 


GCCTTGGTTA 


CAATCCCAGA 


660 


AACTACGTTC 


CAGACTGTTT 


CTTGGCACTT 


CTGCATTGCT 


CTAGGTTACA 


CAAACAGCTG 


720 


CCTCAACCCA 


GTCCTTTATG 


CATTTCTGGA 


TGAAAACTTC 


CACGATGCTT 


CAGAGAGTTC 


780 


TGTATCCCAA 


CCTCTTCCAA 


CATTGAGCAA 


CAAAACTCCA 


CTCGAATTCC 




830 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

GGGTACCGGG CCCCCCCTCG AGGTCGACGG TATCGATAAG CTTGATATCG AATTCTTACT 60 

GAATTAGGTA TCTTTCTTCA CACTACTTGG TAAAAAAAAT GAAAAGGCAG AAAAATTAGC 120 

CCCAAAAGAG ATGAAACTCT TCCGTCCATC ACCATTGACT CTATTGTGAA CTTATGAAAA 180 
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AGGTAGTTGA GCAATATGAA GGCCATGATG TGGAATTAAA CACACACACA CACACACACA 24 0 

CACACACACA CACATGCTGG ATTCTAAATG TGTCCTTCCT CCTCTCACTC TCTTGATTCA 300 

AGTTTATTTC TGAACTGAGA CACGATCACC AC 332 

(2) INFORMATION FOR SEQ ID NO: 15: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGGATCCTTA GCATCCCCAA AGCGCCTCCG TGTACTTCTA AGGTGGGAGG GGGATACAAG 60 

CAGAGGAGAA TATCGGACGC TCAGACGTTC CATTCTGCCT GCCGCTCTTC TCTGGTTCCA 12 0 

CTAGGGCTTG TCCTTGTAAG AAACTGACGG AGCCTAGGGC AGCTGTGAGA GGAAGAGGCT 180 

GGGGCGCCTG GAACCCGAAC ACTCTTGAGT GCTCTCAGTT ACAGNCTACC GAGTCCGCAG 24 0 

GAAGCATTCA GAACCATGGA CAGCAGCGCC GGCCCAGGGA ACATCAGCGA CTGCTCTGAC 300 

CCCTTAGCTC CTGCAAGTTG CTCCCCAGCA CCTGGCTCCT GGCTCAACTT GTCCCACGTT 360 

GATGGAAACC AGTCCGACCC ATGCGGTCCT AACCCGACGG GCCTTGGCGG GAACGACAGC 42 0 

CTGTGCCCTC AGACCGGCAG CCCTTCCATG GTCACAGCCA TCACCATCAT GGCCCTCTAT 480 

TCTATCGTGT GTGTAGTGGG CCTCTTTGGA AACTTCCTGG TCATGTATGT GATTGTAAGA 54 0 

TATACCAAAA TGAAGACTGC CACCAACATC TACATTTTCA ACCTTGCTCT GGCAGATGCC 600 

TTAGCCACTA GCACGCTGCC CTTTCAGAGT GTTAACTACC TGATGGGAAC GTGGCCCTTT 660 

GGAAACATCC TCTGCAAGAT CGTGATCTCA ATAGACTACT ACAACATGTT CACCAGTATC 720 

TTCACCCTCT GCACCATGAG TGTAGACCGC TACATTGCCG TCTGCCACCC GGTCAAGGCC 780 

CTGGATTTCC GTACCCCCCG AAATGCCAAA ATTGTCAATG TCTGCAACTG GATCCTCTCT 84 0 

TCTGCCATTG GTCTGCCCGT AATGTTCATG GCAACCACAA AATACAGGCA GGGGTCCATA 900 

GATTGCACCC TCACGTTCTC TCATCCCACA TGGTACTGGG AGAACCTGCT CAAAATCTGT 960 

GTCTTCATCT TCGCCTTCAT CATGCCGGGC CTCATCATCA CTGTGTGTTA TGGACTGATG 1020 

ATCTTACAGC TCAAGAGTGT CCGCATGCTG TCGGGCTCCA AAGAAAAGGA CAGGAACCTG 1080 

CGCAGGATCA CCCGGATGGT GCTGGTGGTC GTGGCTGTAT TTATTGTCTG CTGGACCCCC 1140 

ATCCACATCT ATGTCATCAT CAAAGCACTG ATCACGATTC CAGAAACCAC TTTCCAGACT 1200 

GTTTCCTGGC ACTTCTGCAT TGCCTTGGGT TACACAAACA GCTGCCTGAA CCCAGTTCTT 1260 

TATGCGTTCC TGGATGAAAA CTTCAAACGA TGTTTTAGAG AGTTCTGCAT CCCAACTTCC 132 0 

TCCACAATCG AACAGCAAAA CTCTGCTCGA ATCCGTCAAA ACACTAGGGA ACACCCCTCC 1380 

ACGGCTAATA CAGTGGATCG AACTAACCAC CAGCTAGAAA ATCTGGAAGC AGAAACTGCT 1440 

CCATTGCCCT AACTGGGTCC CACGCCATCC AGACCCTCGC TAAACTTAGA GGCTGCCATC 1500 
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TACTTGGAAT CAGGTTGCTG TCAGGGTTTG TGGGAGGCTC TGGTTTCCTG GAAAAGCATC 1560 

TGATCCTGCA TCATTCAAAG TCATTCCTCT CTGGCTATTC ACGCTACACG TCAGAGACAC 1620 

TCAGACTGTG TCAAGCACTC AGAAGGAAGA GACTGCAGGC CACTACTGAA TCCAGCTCAT 1680 

GTACAGAAAC ATCCAATGGA CCACAATACT CTGTGGTATG TGATTTGTGA TCAACATAGA 1740 

AGGTGACCCT TCCCTATGTG GAATTTTTAA TTTCAAGGAA ATACTTATGA TCTCATCAAG 1800 

GGAAAAATAG ATGTCACTTG TTAAATTCAC TGTAGTGATG CATAAAGGAA AAGCTACCTC 1860 

TGACCTCTAG CCCAGTCACC CTCTATGGAA AGTTCCATAG GGAATATGTG AGGGAAAATG 1920 

TTGCTTCCAA ATTAAATTTT CACCTTTATG TTATAGTCTA GTTAAGACAT CAGGGGCATC 1980 

T 1981 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 398 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Asp Ser Ser Thr Gly Pro Gly Asn Thr Ser Asp Cys Ser Asp Pro 
1 5 . 10 15 

Leu Ala Gin Ala Ser Cys Ser Pro Ala Pro Gly Ser Trp Leu Asn Leu 
20 25 30 

Ser His Val Asp Gly. Asn Gin Ser Asp Pro .Cys Gly Leu Asn Arg Thr 
35 40 45 

Gly Leu Gly Gly Asn Asp Ser- Leu Cys Pro Gin Thr Gly Ser Pro Ser 
50 55 60 

Met Val Thr Ala lie Thr lie Met Ala Leu Tyr Ser lie Val Cys Val 
65 70 .75 80 

Val Gly Leu Phe Gly Asn Phe Leu Val Met Tyr Val lie Val Arg Tyr 
85 90 95 

Thr Lys Met Lys Thr Ala Thr Asn lie Tyr lie Phe Asn Leu Ala Leu • 
100 105 110 

Ala Asp Ala Leu Ala Thr Ser Thr Leu Pro Phe Gin Ser Val Asn Tyr 
115 120 125 

Leu Met Gly Thr Trp Pro Phe Gly Thr lie Leu Cys Lys lie Val lie 
130 135 140 

Ser lie Asp Tyr Tyr Asn Met Phe Thr Ser lie Phe Thr Leu Cys Thr 
145 150 155 160 

Met Ser Val Asp Arg Tyr lie Ala Val Cys His Pro Val Lys Ala Leu 
165 170 175 

Asp Phe Arg Thr Pro Arg Asn Ala Lys lie Val Asn Val Cys Asn Trp 
180 ' 185 190 
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Ile Leu Ser Ser Ala lie Gly Leu Pro Val Met Phe Met Ala Thr Thr 
195 200 205 

Lys Tyr Arg Gin Gly Ser lie Asp Cys Thr Leu Thr Phe Ser His Pro 
210 215 220 

Thr Trp Tyr Trp Glu Asn Leu Leu Lys; lie Cys Val Phe lie Phe Ala 
225 230 235 240 

Phe lie Met Pro lie Leu lie lie Thr Val Cys Tyr Gly Leu Met lie" 
245 250 255 

Leu Arg Leu Lys Ser Val Arg Met Leu Ser Gly Ser Lys Glu Lys Asp 
260 265 270 

Arg Asn Leu Arg Arg lie Thr Arg Met Val Leu Val Val Val Ala Val 
275 280 285 

Phe lie Val Cys Trp Thr Pro lie His lie Tyr Val lie lie Lys Ala 
290 295 300 

Leu lie Thr lie Pro Glu Thr Thr Phe Gin Thr Val Ser Trp His Phe 
305 310 315 320 

Cys lie Ala Leu Gly Tyr Thr Asn Ser Cys Leu Asn Pro Val Leu Tyr 
325 330 335 

Ala Phe Leu Asp Glu Asn Phe Lys Arg Cys Phe Arg Glu Phe Cys lie 
340 345 350 

Pro Thr Ser Ser Thr lie Glu Gin Gin Asn Ser Thr Arg Val Arg Gin 
355 360 365 

Asn Thr Arg Glu His Pro Ser Thr Ala Asn Thr Val Asp Arg Thr Asn 
370 375 380 

His Gin Leu Glu Asn Leu Glu Ala Glu Thr Ala Pro Leu Pro 
385 390 395 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 

Met Glu Ser Pro lie Gin lie Phe Arg Gly Asp Pro Gly Pro Thr Cys 
1 5 10 15 

Ser Pro Ser Ala Cys Leu Leu Pro Asn Ser Ser Ser Trp Phe Pro Asn 
20 25 30 

Trp Ala Glu Ser Asp Ser Asn Gly Ser Val Gly Ser Glu Asp Gin Gin 
35 40 45 

Leu Glu Ser Ala His lie Ser Pro Ala lie Pro Val lie lie Thr Ala 
50 55 60 

Val Tyr Ser Val Val Phe Val Val Gly Leu Val Gly Asn Ser Leu Val 
65 70 75 80 
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Met Phe Val lie lie Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn lie 
.85 90 95 

Tyr lie Phe Asn Leu Ala Leu Ala Asp Ala Leu Val Thr Thr Thr Met 
100 105 110 

Pro Phe Gin Ser Ala Val Tyr Leu Met Asn Ser Trp Pro Phe Gly Asp 
115 120 125 

'Val Leu Cys Lys lie Val lie Ser lie Asp Tyr Tyr Asn Met Phe Thr 
130 135 140 

Ser lie Phe Thr Leu Thr Met Met Ser Val Asp Arg Tyr lie Ala Val 
145 150 155 160 

Cys His Pro Val Lys Ala Leu Asp Phe Arg Thr Pro Leu Lys Ala Lys 
165 170 175 

lie lie Asn lie Cys lie Trp Leu Leu Ala Ser Ser Val Gly lie Ser 
180 185 190 

Ala lie Val Leu Gly Gly Thr Lys Val Arg Glu Asp Val lie Glu Cys 
195 200 205 

Ser Leu Gin Phe Pro Asp Asp Glu Trp Trp Asp Leu Phe Met Lys lie 
210 215 220 

Cys Val Phe Val Phe Ala Phe Val lie Pro Val Leu lie He He Val 
225 230 235 240 

Cys Tyr Thr Leu Met He Leu Arg Leu Lys Ser Val Arg Leu Leu Ser 
245 250 255 

Gly Ser Arg Glu Lys Asp Arg Asn Leu Arg Arg He Thr Lys Leu Val 
260 265 270 

Leu Val Val Val Ala Val Phe He He Cys Trp Thr Pro He His He 
275 280 285 

Phe He Leu Val Glu Ala Leu Gly Ser Thr Ser His Ser Thr Ala Ala 
290 295 300 

Leu Ser Ser Tyr Tyr Phe Cys He Ala Leu Gly Tyr Thr Asn Ser Ser 
305 310 315 320 

Leu Asn Pro Val Leu Tyr Ala Phe Leu Asp Glu Asn Phe Lys Arg Cys 
325 330 335 

Phe Arg Asp Phe Cys Phe Pro He Lys Met Arg Met Glu Arg Gin Ser 
340 345 350. 

Thr Asn Arg Val Arg Asn Thr Val Gin Asp Pro Ala Ser Met Arg Asp 
355 360 365 

Val Gly Gly Met Asn Lys Pro Val 
370 375 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10 1119 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

GGCAGTGGC ATG GAG CCC CTC TTC CCC GCG CCG TTC TGG GAG GTT ATC 48 
Met Glu Pro Leu Phe Pro Ala Pro Phe Trp Glu Val lie 
.375 380 385 

TAC GGC AGC CAC CTT CAG GGC i\AC CTG TCC CTC CTG AGC CCC AAC CAC . 96 

Tyr Gly Ser His Leu Gin Gly Asn Leu Ser Leu Leu Ser Pro Asn His 
390 395 400 

AGT CTG CTG CCC CCG CAT CTG CTG CTC AAT GCC AGC CAC GGC GCC TTC 144 
Ser Leu Leu Pro Pro His Leu Leu Leu Asn Ala Ser His Gly Ala Phe 
405 410 415 

CTG CCC CTC GGG CTC AAG GTC ACC ATC GTG GGG CTC TAC CTG GCC GTG 192 
Leu Pro Leu Gly Leu Lys Val Thr lie Val Gly Leu Tyr Leu Ala Val 
420 425 430 

TGT GTC GGA GGG CTC CTG GGG AAC TGC CTT GTC ATG TAC GTC ATC CTC 240 
Cys Val Gly Gly Leu Leu Gly Asn Cys Leu Val Met Tyr Val lie Leu 
435 440 445 

AGG CAC ACC AAA ATG AAG ACA GCC ACC AAT ATT TAC ATC TTT AAC CTG 288 
Arg His Thr Lys Met Lys Thr Ala Thr Asn lie Tyr lie Phe Asn Leu 
450 455 460 465 

GCC CTG GCC GAC ACT CTG GTC CTG CTG ACG CTG CCC TTC CAG GGC ACG 336 
Ala Leu Ala Asp Thr Leu Val Leu Leu Thr Leu Pro Phe Gin Gly Thr 
470 475 : 480 

GAC ATC CTC CTG GGC TTC TGG CCG TTT GGG AAT GCG CTG TGC AAG ACA 384 
Asp lie Leu Leu Gly Phe Trp Pro Phe Gly Asn Ala Leu Cys Lys Thr 
485 490 495 

GTC ATT GCC ATT GAC TAC TAC AAC ATG TTC ACC AGC ACC TTC ACC CTA 432 
Val lie Ala lie Asp Tyr Tyr Asn Met Phe Thr Ser Thr Phe Thr Leu 
500 505 510 

ACT GCC ATG AGT GTG GAT CGC TAT GTA GCC ATC TGC CAC CCC ATC CGT 480 
Thr Ala Met Ser Val Asp Arg Tyr Val Ala lie Cys His Pro lie Arg 
515 520 525 

GCC CTC GAC GTC CGC ACG TCC AGC AAA GCC CAG GCT GTC AAT GTG GCC 528 
Ala Leu Asp Val Arg Thr Ser Ser Lys Ala Gin Ala Val Asn Val Ala 
530 535 540 545 

ATC TGG GCC CTG GCC TCT GTT GTC GGT GTT CCC GTT GCC ATC ATG GGC 576 
He Trp Ala Leu Ala Ser Val Val Gly Val Pro Val Ala He Met Gly 
550 555 560 

TCG GCA CAG GTC GAG GAT GAA GAG ATC GAG TGC CTG GTG GAG ATC CCT 624 
Ser Ala Gin Val Glu Asp Glu Glu He Glu Cys Leu Val Glu He Pro 
565 570 575 

ACC CCT CAG GAT TAC TGG GGC CCG GTG TTT GCC ATC TGC ATC TTC CTC 672 
Thr Pro Gin Asp Tyr Trp Gly Pro Val , Phe Ala He Cys He Phe Leu 
580 585 * 590 



TTC TCC TTC ATC GTC CCC GTG CTC GTC ATC TCT GTC TGC TAC AGC CTC 
Phe Ser Phe He Val Pro Val Leu Val He Ser Val Cys Tyr Ser Leu 
595 600 605 



720 
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ATG ATC CGG CGG CTC CGT GGA GTC CGC CTG CTC TCG GGC TCC CGA GAG 768 
Met lie Arg Arg Leu Arg Gly Val Arg Leu Leu Ser Gly Ser Arg Glu 
610 615 620 625 

AAG GAC CGG AAC CTG CGG CGC ATC ACT CGG CTG GTG CTG GTG GTA GTG 816 
Lys Asp Arg Asn Leu Arg Arg lie Thr Arg Leu Val Leu Val Val Val 
630 635 640 

GCT GTG TTC GTG GGC TGC TGG ACG CCT GTC CAG GTC TTC GTG CTG GCC 864 
Ala Val Phe Val Gly Cys Trp Thr Pro Val Gin Val Phe Val Leu Ala 
645 650 655 

CAA GGG CTG GGG GTT CAG CCG AGC AGC GAG ACT GCC GTG GCC ATT CTG 912 
Gin Gly Leu Gly Val Gin Pro Ser Ser Glu Thr Ala Val Ala lie Leu 
660 665 670 

CGC TTC TGC ACG GCC CTG GGC TAC GTC AAC AGC TGC CTC AAC CCC ATC 960 
Arg Phe Cys Thr Ala Leu Gly Tyr Val Asn Ser Cys Leu Asn Pro lie 
675 680 685 

CTC TAC GCC TTC CTG GAT GAG AAC TTC AAG GCC TGC TTC CGC AAG TTC 1008 
Leu Tyr Ala Phe Leu Asp Glu Asn Phe Lys Ala Cys Phe Arg Lys Phe 
690 695 700 705 

TGC TGT GCA TCT GCC CTG CGC CGG GAC GTG CAG GTG TCT GAC CGC GTG 1056 
Cys Cys Ala Ser Ala Leu Arg Arg Asp Val Gin Val Ser Asp Arg Val 
710 715 720 

CGC AGC ATT GCC AAG GAC GTG GCC CTG GCC TGC AAG ACC TCT GAG ACG 1104 
Arg Ser lie Ala Lys Asp Val Ala Leu Ala Cys Lys Thr- Ser Glu Thr 
725 730 735 

GTA CCG CGG CCC GCA TGACTAGGCG TGGACCTGCC CATGGTGCCT GTCAGCCCGC 1159 
Vai Pro Arg Pro Ala 
740 

AGAGCCCATC TACGCCCAAC ACAGAGCTCA CACAGGTCAC TGCTCTCTAG GCGGACACAC 1219 

CCTGGGCCCT GAGCATCCAG AGCCTGGGAT GGGCTTTTCC CTGTGGGCCA GGGATGCTCG 1279 

GTCCCAGAGG AGGACCTAGT GACATCATGG GACAGGTCAA AGCATTAGGG CCACCTCCAT 1339 

GGCCCCAGAC AGACTAAAGC TGCCCTCCTG GTGCAGGGCC GAGGGGACAC AAGGACCTAC 13 99 

CTGGAAGCAG CTGACATGCT GGTGGACGGC CGTTACTGGA GCCCGTGCCC CTCCCTCCCC 1459 

GTGCTTCATG TGACTCTTGG CCTCTCTGCT GCTGCGTTGG CAGAACCCTG GGTGGGCAGG 1519 

CACCCGGAGG AGGAGCAGCA GCTGTGTCAT CCTGTGCCCC CCATGTGCTG TGTGCTGTTT 1579 

GCATGGCAGG GCTCCAGCTG CCTTCAGCCC TGTGACGTCT CCTCAGGGCA GCTGGACAGG 1.63 9 

CTTGGCACGG CCCGGGAAGT GCAGCAGGCA GCTTTTCTTT GGGGTGGGAC TTGCCCTGAG 1699 

CTTGGAGCTG CCACCTGGAG GACTTGCCTG TTCCGACTCC ACCTGTGCAG CCGGGGCCAC 1759 

CCCAGGAGAA AGTGTCCAGG TGGGGGCTGG CAGTCCCTGG CTGCAG 1805 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Glu Pro Leu Phe Pro Ala Pro Phe Trp Glu Val lie Tyr Gly Ser 

'l 5 . 10 15 

His Leu Gin Gly Asn Leu Ser Leu Leu Ser Pro Asn His Ser Leu Leu 
20 25 30 

Pro Pro His Leu Leu Leu Asn Ala Ser His Gly Ala Phe Leu Pro Leu 
35 40 45 

Gly Leu Lys Val Thr lie Val Gly Leu Tyr Leu Ala Val Cys Val Gly 
50 55 60 

Gly Leu Leu Gly Asn Cys Leu Val Met Tyr Val lie Leu Arg His Thr 
65 70 75 80 

Lys Met Lys Thr Ala Thr Asn lie Tyr lie Phe Asn Leu Ala Leu Ala 
85 90 95 

Asp Thr Leu Val Leu Leu Thr Leu Pro Phe Gin Gly Thr Asp lie Leu 
100 105 110 

Leu Gly Phe Trp Pro Phe Gly Asn Ala Leu Cys Lys Thr Val lie Ala 
115 120 125 

lie Asp Tyr Tyr Asn Met Phe Thr Ser Thr Phe Thr Leu Thr Ala Met 
130 135 140 

Ser Val Asp Arg Tyr Val Ala lie Cys His Pro He Arg Ala Leu Asp 
145 150 155 160 

Val Arg Thr Ser Ser Lys Ala Gin Ala Val Asn Val Ala He Trp Ala 
165 170 175 

Leu Ala Ser Val Val Gly Val Pro Val Ala He Met Gly Ser Ala Gin 
180 185 190 

Val Glu Asp Glu Glu He Glu Cys Leu Val Glu He Pro Thr Pro Gin 
195 200 205 

Asp Tyr Trp Gly Pro Val Phe Ala He Cys He Phe Leu Phe Ser Phe , 
210 215 220 

He Val Pro Val Leu Val He Ser Val Cys Tyr Ser Leu Met He Arg 
225 230 235 240 

Arg Leu Arg Gly Val Arg Leu Leu Ser Gly Ser Arg Glu Lys Asp Arg 
245 250 255 

Asn Leu Arg Arg He Thr Arg Leu Val Leu Val Val Val Ala Val Phe 
260 265 270 

Val Gly Cys Trp Thr Pro Val Gin Val Phe Val Leu Ala Gin Gly Leu 
275 280 285 

Gly Val Gin Pro Ser Ser Glu Thr Ala Val Ala He Leu Arg Phe Cys 
290 295 300 

Thr Ala Leu Gly Tyr Val Asn Ser Cys Leu Asn Pro He Leu Tyr Ala 
305 310 315 320 

Phe Leu Asp Glu Asn Phe Lys Ala Cys Phe Arg Lys Phe Cys Cys Ala 
325 330 335 
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Ser Ala Leu Arg Arg Asp Val Gin Val Ser Asp Arg Val Arg Ser lie 
340 345 350. 

Ala Lys Asp Val Ala Leu Ala Cys Lys Thr Ser Glu Thr Val Pro Arg 
355 360 365 

Pro Ala 
370 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 
^ (D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modifded-site 

(B) LOCATION: groupO, 12, 33, 40, 48) 

(D) OTHER INFORMATION: /note= "Extracellular Asn residues 
that are consensus sites for N- linked glycosylation" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Asp Ser Ser Ala Ala Pro Thr Asn Ala Ser Asn Cys Thr Asp Ala 
1 5 10 15 

Leu Ala Tyr Ser Ser Cys Ser Pro Ala Pro Ser Pro Gly Ser Trp Val 
20 25 30 

Asn Leu Ser His Leu Asp Gly Asn Leu Ser Asp Pro Cys Gly Pro Asn 
35 40 45 

Arg Thr Asn Leu Gly Gly Arg Asp Ser Leu Cys Pro Pro Thr Gly Ser 
50 55 60 

Pro Ser Met lie Thr Ala lie Thr lie Met Ala Leu Tyr Ser lie Val 
65 70 75 80 

Cys Val Val Gly Leu Phe Gly Asn Phe Leu Val Met Tyr Val lie Val 
85 90 95 

Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn lie Tyr lie Phe Asn Leu 
100 105 110 

Ala Leu Ala Asp Ala Leu Ala Thr Ser Thr Leu Pro Phe Gin Ser Val 
115 120 125 

Asn Tyr Leu Met Gly Thr Trp Pro Phe Gly Thr lie Leu Cys Lys lie 
130 135 140 

Val lie Ser lie Asp Tyr Tyr Asn Met Phe Thr Ser lie Phe Thr Leu 
145 150 155 160 

Cys Thr Met Ser Val Asp Arg Tyr lie Ala Val Cys His Pro Val Lys 
165 170 175 

Ala Leu Asp Phe Arg Thr Pro Arg Asn Ala Lys lie lie Asn Val Cys 
180 185 190 

Asn Trp lie Leu Ser Ser Ala lie Gly Leu Pro Val Met Phe Met Ala 
195 200 205 
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Thr Thr Lys Tyr Arg Gin Gly Ser lie Asp Cys Thr Leu Thr Phe Ser 
210 215 220 

His Pro Thr Trp Tyr Trp Glu Asn Leu Val Lys lie Cys Val Phe lie 
225 ^ 230 235 240 

Phe Ala Phe lie Met Pro Val Leu lie lie Thr Val Cys Tyr Gly Leu 
245 250 255 

Met lie Leu Arg Leu Lys Ser Val Arg Met Leu Ser Gly Ser Lys Glu 
260 265 270 

Lys Asp Arg Asn Leu Arg Arg lie Thr Arg Met Val Leu Val Val Val 
275 280 285 

Ala Val Phe lie Val Cys Trp Thr Pro He His He Tyr Val He He 
290 295 300 

Lys Ala Leu Val Thr He Pro Glu Thr Thr Phe Gin Thr. Val Ser Trp 
305 310 315 320 

His Phe Cys He Ala Leu Gly Tyr Thr Asn Ser Cys Leu Asn Pro Val 
325 330 335 

Leu Tyr Ala Phe Leu Asp Glu Asn Phe Lys Arg Cys Phe Arg Glu Phe 
340 345 350 

Cys He Pro Thr Ser Ser Asn He Glu Gin Gin Asn Ser Thr Arg He 
355 360 365 

Arg Gin Asn Thr Arg Asp His Pro Ser Thr Ala Asn Thr Val Asp Arg 
370 375 380 

Thr Asn His Gin Leu Glu Asn Leu Glu Ala Glu Thr Ala Pro Leu Pro 
385 390 395 400 

Phe He Gly Phe He Arg Ser Thr Ser Glu Gin Glu Asn Cys Glu 
405 410 415 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: group (18, 33) 

(D) OTHER INFORMATION: /note= "Extracellular Asn residues 
that are consensus sites for N-linked glcosylation" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Glu Pro Ala Pro Ser Ala Gly Ala Glu Leu Gin Pro Pro Leu Phe 
1 5 10 15 

Ala Asn Ala Ser Asp Ala Tyr Pro Ser Ala Cys Pro Ser Ala Gly Ala 
20 25 30 

Asn Ala Ser Gly Pro Pro Ala Arg Ser Ala Ser Ser Leu Ala Leu Ala 
35 40 . 45 



He Ala He Thr Ala 

50 

Gly Asn Val Leu Val 

Thr Ala Thr Asn He 
85 

Ala Thr Ser Thr Leu 
100 

Trp Pro Phe Gly Glu 
115 

Tyr Asn Met Phe Thr 
130 

Arg Tyr He Ala Val 
145 

Pro Ala Lys Ala Lys 
165 

Gly Val Gly Val Pro 
180 

Gly Ala Val Val Cys 
195 

Asp Thr Val Thr Lys 
210 

He Leu He He Thr 
225 

Ser Val Arg Leu Leu 
245 

Arg He Thr Arg Met 
260 

Trp Ala Pro He His 
275 

Arg Arg Asp Pro Leu 
290 

Gly Tyr Ala Asn Ser 
305 

Glu Asn Phe Lys Arg 
325 

Arg Pro Asp Pro Ser 
340 

Glu Arg Val Thr Ala 
355 
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Leu Tyr Ser Ala Val Cys 
55 

Met Phe Gly He Val Arg 
70 75 

Tyr He Phe Asn Leu Ala 
90 

Pro Phe Gin Ser Ala Lys 
105 

Leu Leu Cys Lys Ala Val 
120 

Ser He Phe Thr Leu Thr 
135 

Cys His Pro Val Lys Ala 
150 155 

Leu He Asn He Cys He 
170 

He Met Val Met Ala Val 
185 

Met Leu Gin Phe Pro Ser 
200 

He Cys Val Phe Leu Phe 
215 

Val Cys Tyr Gly Leu Met 
230 235 

Ser Gly Ser Lys Glu Lys 
250 

Val Leu Val Val Val Gly 
265 

He Phe Val He Val Trp 
280 

Val Val Ala Ala Leu His 
295 

Ser Leu Asn Pro Val Leu 
310 315 

Cys Phe Arg Gin Leu Cys 
330 

Ser Phe Ser Arg Ala Arg 
345 

Cys Thr Pro Ser Asp Gly 
360 



Ala Val Gly Leu He 
60 

Tyr Thr Lys Met Lys 
80 

Leu Ala Asp Ala Leu 
95 

Tyr Leu Met Glu Thr 
110 

Leu Ser He Asp Tyr 
125 

Met Met Ser Val Asp 
140 

Leu Asp Phe Arg Thr 
160 

Trp Val Leu Ala Ser 
175 

Thr Arg Pro Arg Asp 
190 

Pro Ser Trp Tyr Trp 
205 

Ala Phe Val Val Pro 
220 

Leu Leu Arg Leu Arg 
240 

Asp Arg Ser Leu Arg 
255 

Ala Phe Val Val Cys 
270 

Thr Leu Val Asp He 
285 

Leu Cys He Ala Leu 
300 

Tyr Ala Phe Leu Asp 
320 

Arg Lys Pro Cys Gly 
335 

Glu Ala Thr Ala Arg 
350 

Pro Gly Gly Gly Ala 
365 



Ala Ala 
370 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: group (25, 39) 

(D) OTHER INFORMATION: /note= "Extracellular Asn residues 
that are consensus sites for N- linked glycosylation" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Glu Ser Pro lie Gin lie Phe Arg Gly Asp Pro Gly Pro Thr Cys 
1 5 10 15 

Ser Pro Ser Ala Cys Leu Leu Pro Asn Ser Ser Ser Trp Phe Pro Asn 
20 25 . ■ 30 

Trp Ala Glu Ser Asp Ser Asn Gly Ser Val. Gly Ser Glu Asp Gin Gin 
35 40 45 

Leu Glu Ser Ala His lie Ser Pro Ala lie Pro Val lie lie Thr Ala 
50 55 60 

Val Tyr Ser Val Val Phe Val Val Gly Leu Val Gly Asn Ser Leu Val 
65 70 '75 80 

Met Phe Val lie lie Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn lie 
85 90 95 

Tyr lie Phe Asn Leu Ala Leu Ala Asp Ala Leu Val Thr Thr Thr Met 
100 105 110 

Pro Phe Gin Ser Ala Val Tyr Leu Met Asn Ser Trp Pro Phe Gly Asp 
115 120 125 

Val Leu Cys Lys lie Val lie Ser lie Asp Tyr Tyr Asn Met Phe Thr 
130 135 140 

Ser lie Phe Thr Leu Thr Met Met Ser Val Asp Arg Tyr lie Ala Val 
145 150 155 160 

Cys His Pro Val Lys Ala Leu Asp Phe Arg Thr Pro Leu Lys Ala Lys 
165 170 175 

lie lie Asn lie Cys lie Trp Leu Leu Ala Ser Ser Val Gly lie Ser 
180 185 190 

Ala He Val Leu Gly Gly Thr Lys Val Arg Glu Asp Val He Glu Cys 
195 200 205 

Ser Leu Gin Phe Pro Asp Asp Glu Trp Trp Asp Leu Phe Met Lys He 
210 215 220 

Cys Val Phe Val Phe Ala Phe Val He Pro Val Leu He He He Val 
225 230 235 240 

Cys Tyr Thr Leu Met He Leu Arg Leu Lys Ser Val Arg Leu Leu Ser 
245 250 255- 

Gly Ser Arg Glu Lys Asp Arg Asn Leu Arg Arg He Thr Lys Leu Val 
260 265 270 
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Leu Val Val Val Ala Val Phe He lie Cys Trp Thr Pro He His He 
275 280 285 

Phe He Leu Val Glu Ala Leu Gly Ser Thr Ser His Ser Thr Ala Ala 
290 295 300 

Leu Ser Ser Tyr Tyr Phe Cys He Ala Leu Gly Tyr Thr Asn Ser Ser 
305 310 315 320 

Leu Asn Pro Val Leu Tyr Ala Phe Leu Asp Glu Asn Phe Lys Arg Cys 
325 330 335 

Phe Arg Asp Phe Cys Phe Pro He Lys Met Arg Met Glu Arg Gin Ser 
340 345 350 

Thr Asn Arg Val Arg Asn Thr Val Gin Asp Pro Ala Ser Met Arg Asp 
355 360 365 

Val Gly Gly Met Asn Lys Pro Val Glu Gin Met His Glu Lys Ser He 
370 375 380 

Pro Tyr Ser Gin Glu Thr Leu Val Val Asp 
385 390 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: group (19, 26, 37) 

(D) OTHER INFORMATION: /note= "Extracellular Asn residues 
that are consensus sites for N- linked glycosylation" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met Glu Pro Leu Phe Pro Ala Pro Phe Trp He Tyr Gly Ser His Leu 
1-5 10 15 

Gin Gly Asn Leu Ser Leu Leu Ser Pro Asn His Ser Leu Leu Pro Pro 
20 25 30 

His Leu Leu Leu Asn Ala Ser His Gly Ala Leu Pro Leu Gly Leu Lys 
35 40 45 

Val Thr He Val Gly Leu Tyr Leu Ala Val Cys Val Gly Gly Leu Leu 
50 55 60 

Gly Asn Cys Leu Val Met Tyr Val Leu Leu Arg His Thr Lys Met Lys 
65 70 75 80 

Thr Ala Thr Asn He Tyr He Phe Asn Leu Ala Leu Ala Asp Thr Leu 
85 90 95 

Val Leu Leu Thr Leu Pro Phe Gin Gly Thr Asp He Leu Leu Gly Pro 
100 105 110 

Trp Pro Phe Gly Asn Ala Leu Cys Lys Thr Val He Ala He Asp Tyr 
115 120 125 
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Tyr Asn Met Phe Thr Ser Thr Phe Thr Leu Thr Ala Met Ser Val Asp 
130 135 140 

Arg Tyr Val Ala lie Cys His Pro lie Arg Ala Leu Asp Val Arg Thr 
145 150 155 160 

Ser Ser Lys Ala Gin Ala Val Asn Val Ala lie Trp Ala Leu Ala Ser 
165 170 175 

Val Val Gly Val Pro Val Ala He Met Gly Ser Ala Gin Val Glu Asp 
180 185 190 

Glu Glu He Glu Cys Leu Val Glu He Pro Thr Pro Gin Asp Tyr Trp 
195 200 *205 

Gly Pro Val Phe Ala He Cys He Phe Leu Phe Ser Phe He Val Pro 
210 215 220 

Val Leu Val He Ser Val Cys Tyr Ser Leu Met He Arg Arg Leu Arg 
225 230 235 240 

Gly Val Arg Leu Leu Ser Gly Ser Arg Glu Lys Asp Arg Asn Leu Arg 
245 250 255 

Arg He Thr Arg Leu Val Leu Val Val Val Ala Val Phe Val Gly Cys 
260 265 270 

Trp Thr Pro Val Gin Val Phe Val Leu Ala Gin Gly Leu Gly Val Gin 
275 280 285 

Pro Ser Ser Glu Thr Ala Val Ala He Leu Arg Phe Cys Thr Ala Leu 
290 295 300 

Gly Tyr Val Asn Ser Cys Leu Asn Pro He Leu Tyr Ala Phe Leu Asp 
305 310 315 320 

Glu Asn Phe Lys Ala Cys Phe Arg Lys Phe Cys Cys Ala Ser Ala Leu 
325 330 335 

Arg Arg Asp Val Gin Val Ser Asp Arg Val Arg Ser He Ala Lys Asp 
340 345 350 

Val Ala Leu Ala Cys Lys Thr Ser Glu Thr Val Pro Arg Pro Ala 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note= "Extracellular Asn residue 
which is a consensus site for N- linked glycosylation" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Glu Glu Gly Gly Asp Phe Asp Asn Tyr Tyr Gly Ala Asp Asn Gin 
1 5 ,10 15 
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Ser Glu Cys Glu Tyr Thr Asp Trp Lys Ser Ser Gly Ala Leu lie Pro 
20 .25 30 

Ala lie Tyr Met Leu Val Phe Leu Leu Gly Thr Thr Gly Asn Gly Leu 
35 40 45 

Val Leu Trp Thr Val Phe Arg Ser Ser Arg Lys Arg Arg Ser Ala Asp 

50 , 55 .60 

lie Phe lie Ala Ser Leu Ala Val Ala Asp Leu Thr Phe Val Val Thr 
65 70 75 80 

Leu Pro Leu Ala Thr Tyr Thr Tyr Arg Asp Tyr Asp Trp Pro Phe Gly 
85 90 95 

Thr Phe Phe Cys Lys Leu Ser Ser Tyr Leu lie Phe Val Asn Met Tyr 
100 105 110 

Ala Ser Val Phe Cys Leu Thr Gly Leu Ser Phe Asp Arg Tyr Leu Ala 
115 120 125 

lie Val Arg Pro Val Ala Asn Ala Arg Leu Arg Leu Arg Val Ser Gly 
130 . 135 140 

Ala Val Ala Thr Ala Val Leu Trp Val Leu Ala Ala Leu Leu Ala Met 
145 150 155 160 

Pro Val Met Val Leu Thr Thr Gly Asp Leu Glu Asn Thr Thr Val Gin 
165 170 175 

Cys Tyr Met Asp Tyr Ser Ser Ser Glu Trp Ala Trp Glu Val Gly Leu 
180 185 190 

Gly Val Ser Ser Thr Thr Val Gly Phe Val Val Pro Phe Thr lie Met 
195 200 205 

Leu Thr Cys Tyr Phe Phe lie Ala Gin Thr lie Ala Gly His Phe Arg 
210 215 220 

Lys Glu Arg lie Glu Gly Leu Arg, Lys Arg Arg Arg Leu Leu Ser lie 
225 230 235 240 

lie Val Val Leu Val Val Thr Phe Ala Leu Cys Trp Met Pro Tyr His 
245 250 255 

Leu Tyr Met Leu Gly Ser Leu Leu His Trp Pro Cys Asp Asp Leii Phe 
260 265 270 

Leu Met Asn lie Phe Pro Tyr Cys Thr Cys lie Ser Tyr Val Asn Ser 
275 280 285 

Cys Leu Asn Pro Phe Leu Tyr Ala Phe Phe Asp Pro Arg Phe Arg Ala 
290 295 ' 300 

Cys Thr Ser Met Leu Cys Cys Gly Gin Ser Arg Cys Ala Gly Thr Ser 
305 310 315 320 

His Ser Ser Ser Gly Glu Lys Ser Ala Ser Tyr Ser Ser Gly His Ser 
325 330 335 

Gin Gly Pro Gly Pro Asn Met Gly Lys Gly Gly 
340 345 
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INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "Any amino acid" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Tyr Gly Gly Phe Xaa 
1 5 



